Automatic Summarization (Mani) Book Review

نویسنده

  • Hal Daumé
چکیده

Researchers in automatic document summarization have already adopted many techniques from existing machine translation literature. Likewise, there is much that the machine translation community can learn from current research in summarization. Automatic Summarization, by Inderjeet Mani, provides a firm grounding in the primary techniques that have been applied to the summarization task, so that researchers or students unfamiliar with the topic will be sufficiently versed in its primary techniques to begin their own work. The book provides a logically structured, comprehensive survey of what the field of summarization was, up through the time of the book’s publication. Overall, Automatic Summarization provides a strong and effective comparison between techniques that professional human summarizers employ and those used by automatic summarization systems. This is one of the aspects that differentiates the book from Advances in Automatic Text Summarization, edited by Mani and Maybury. For a beginner in the summarization field, the introductory coverage of the various dimensions along which summarization systems can lie, and the coverage of basic definitions, is essential. The format is effective for readers of varying levels of sophistication: for students, the chapter conclusions help solidify the main points; for researchers, they provide a good reference into summarization literature. The book covers much ground: the content flows from simple extraction-based systems, through revision and discourse, to abstraction, multi-document summarization and multi-media summarization. This content is bookended by an introductory chapter on professional summarization and a concluding chapter on summary evaluation. The introductory chapter on professional summarization sets the framework for the remainder of the book, in which each task is considered, as it relates to the methods humans use for producing summaries. The automatic text summarization discussed ranges from simple single-document sentence extraction systems to systems that produce abstracts of document collections. The research presented in the chapters on extraction, revision and discourse has reached some reasonable level of stability, and accordingly, the chapter organization is easy to follow. The quality of the organization decreases in the chapters on abstraction, multi-document summarization and multi-media summarization; this reflects both that the later topics cover research that is, itself, inconclusive and that the topics are, themselves, less interrelated that those discussed in the first few chapters. The concluding chapter covers the problem of summary evaluation. Evaluation is largely an unsolved problem in summarization, due greatly to the fact that different humans typically create very different summaries of the same document. This pre-

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Systematic literature review of fuzzy logic based text summarization

Information Overloadrq  is not a new term but with the massive development in technology which enables anytime, anywhere, easy and unlimited access; participation & publishing of information has consequently escalated its impact. Assisting userslq    informational searches with reduced reading surfing time by extracting and evaluating accurate, authentic & relevant information are the primary c...

متن کامل

A survey on Automatic Text Summarization

Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...

متن کامل

Advances in Automatic Text Summarization

It has been said for decades (if not centuries) that more and more information is becoming available and that tools are needed to handle it. Only recently, however, does it seem that a sufficient quantity of this information is electronically available to produce a widespread need for automatic summarization. Consequently, this research area has enjoyed a resurgence of interest in the past few ...

متن کامل

Automatic Text Summarization in TIPSTER

Automatic Text Summarization was added as a major research thrust of the TIPSTER program during TIPSTER Phase III, 1996-1998. It is a natural extension of the previously supported research efforts in Information Extraction (IE) and Information Retrieval (IR). There is considerable interest in automatically producing summaries due, in large part, to the growth of the Internet and the World Wide ...

متن کامل

Summarization Evaluation: An Overview

This paper provides an overview of different methods for evaluating automatic summarization systems. The challenges in evaluating summaries are characterized. Both intrinsic and extrinsic approaches are discussed. Methods for assessing informativeness and coherence are described. The advantages and disadvantages of specific methods are assessed, along with criteria for choosing among them. The ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004